Word Sense Disambiguation

ثبت نشده
چکیده

A word may have more than one meaning or sense (polysemous). For instance, the word “light” may have a meaning of “a source of lighting or illumination” as in “Turn off the light”. It can also have a meaning of “not heavy” as in “My new phone is small and light”. The meaning can usually be differentiated or disambiguated by observing the context where the word is used. In a written text or sentence, the context can be described in the simplest case as the information of the surrounding words. In a more complex case, as in the use of word bank in “I’ll meet you at the bank”, the disambiguation requires more than just the information from the surrounding words. Since, the bank may have a meaning of “a financial institution (a building)” but can also have a meaning of “river bank”. Both meanings are possible and the correct meaning will depend on the shared knowledge between the speakers. Word sense disambiguation is the task to decide, given a text or sentence, the correct meaning of a certain word inside the text. In this project, we experimented in disambiguating the word senses of three different words: hard, line, and serve. We use the data developed by (Leacock). The data contains sentences with the target word (word to be disambiguated) tagged and disambiguated using the senses from WordNet (Fellbaum). There will be one word that we need to disambiguate in each sentence and hence, the meaning of the word will be disambiguated using this one sentence context. We will use three different machine learning methods to do the classification of the word into their respective sense. The performance of each method will be measured by its accuracy, i.e. the number of sentences with correct prediction divided by the total number of sentences predicted. We will describe the data and the features we use in Section 2. Section 3 will provide theoretical background on the selected machine learning methods. In Section 4, we will describe our experiments and our analysis of the results. The conclusions of the work will be given in Section 5.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

رفع ابهام معنایی واژگان مبهم فارسی با مدل موضوعی LDA

Word sense disambiguation is the task of identifying the correct sense for the word in a given context among a finite set of possible sense. In this paper a model for farsi word sense disambiguation is presented. The model use two group of features: first, all word and stop words around target word and topic models as second features. We extract topics from a farsi corpus with Latent Dirichlet ...

متن کامل

Word Domain Disambiguation via Word Sense Disambiguation

Word subject domains have been widely used to improve the performance of word sense disambiguation algorithms. However, comparatively little effort has been devoted so far to the disambiguation of word subject domains. The few existing approaches have focused on the development of algorithms specific to word domain disambiguation. In this paper we explore an alternative approach where word doma...

متن کامل

A Review Of Literature On Word Sense Disambiguation

lexical ambiguity is a fundamental characteristic of language. Words can have more than one distinct meaning. Word sense disambiguation is defined as the problem of computationally determining which”sense”of a word is correct in given context. Word sense disambiguation is a task of classification where word senses are the classes, the context provides the evidence, and each occurrence of a word...

متن کامل

Optimizing Classifier Performance in Word Sense Disambiguation by Redefining Sense Classes

Learning word sense classes has been shown to be useful in fine-grained word sense disambiguation [Kohomban and Lee, 2005]. However, the common choice for sense classes, WordNet lexicographer files, are not designed for machine learning based word sense disambiguation. In this work, we explore the use of clustering techniques in an effort to construct sense classes that are more suitable for wo...

متن کامل

Improving Japanese Zero Pronoun Resolution by Global Word Sense Disambiguation

This paper proposes unsupervised word sense disambiguation based on automatically constructed case frames and its incorporation into our zero pronoun resolution system. The word sense disambiguation is applied to verbs and nouns. We consider that case frames define verb senses and semantic features in a thesaurus define noun senses, respectively, and perform sense disambiguation by selecting th...

متن کامل

How Phrase Sense Disambiguation outperforms Word Sense Disambiguation for Statistical Machine Translation

We present comparative empirical evidence arguing that a generalized phrase sense disambiguation approach better improves statistical machine translation than ordinary word sense disambiguation, along with a data analysis suggesting the reasons for this. Standalone word sense disambiguation, as exemplified by the Senseval series of evaluations, typically defines the target of disambiguation as ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013